Model Selection

Wikipedia training

# Wikipedia training

Yugogpt Florida Q8 0 GGUF

YugoGPT-Florida is a large language model based on Serbian, excelling in multiple evaluation benchmarks.

Large Language Model Other

A small language model with only 16 million parameters, mainly used for debugging and testing, supporting English and Japanese.

Large Language Model

Transformers Supports Multiple Languages

Simcse Model XLMR

A sentence-transformers model based on XLM-R, trained using the SimCSE method, which maps sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as clustering or semantic search.

Simcse Model Phayathaibert

This is a model based on sentence-transformers that can map sentences and paragraphs into a 768-dimensional dense vector space, suitable for tasks such as clustering or semantic search.

Simcse Model M Bert Thai Cased

A SimCSE model based on mBERT, specifically trained for Thai language, used to generate 768-dimensional vector representations of sentences and paragraphs

Minilm L6 H384 Italian Cross Encoder

Italian text ranking model based on MiniLMv2 architecture, with optimized embedding layers for Italian

Text Embedding Other

Abstract Sim Query

A model that maps abstract sentence descriptions to matching sentences, trained on Wikipedia using a dual-encoder architecture.

Transformers English

Abstract Sim Sentence

A model that maps abstract sentence descriptions to matching sentences, trained on Wikipedia using a dual encoder architecture.

Transformers English

Sbert Large Cased Pl

SHerbert large is an improved SentenceBERT model based on the Polish HerBERT, designed to generate semantically meaningful sentence embeddings and compare them using cosine similarity.

Text Embedding Other

Multilingual Bert Gn Base Cased

A language model fine-tuned for Guarani based on the multilingual BERT base model, supporting 104 languages including Guarani

Large Language Model

Transformers Other

A Danish GPT2-style model trained using the Flax CLM pipeline, based on the Danish portion of the wiki40b dataset.

Large Language Model Other

Bert Base Mongolian Cased

This is a pre-trained Mongolian BERT model, trained on Mongolian Wikipedia and news datasets, supporting Mongolian text processing tasks.

Large Language Model Other

Simcse Model Distil M Bert

A sentence transformer model based on m-Distil-BERT, trained using the SimCSE method, capable of mapping text to 768-dimensional vectors, suitable for semantic search and clustering tasks

Bertinho Gl Small Cased

A pre-trained BERT model for Galician (6 layers, case-sensitive). Trained on Wikipedia.

Large Language Model Other

This is a Swedish GPT2-style model trained using the Flax CLM process, with training data from the Swedish portion of the wiki40b dataset.

Large Language Model Other

A Korean pre-trained language model based on the BERT architecture, suitable for Korean text processing tasks.

Large Language Model

Transformers Korean

Simcse Model M Bert Thai Cased

A Thai sentence embedding model based on mBERT, trained using the SimCSE method on Thai Wikipedia data, capable of mapping text to 768-dimensional vectors

Bert Base Thai Upos

BERT model pre-trained on Thai Wikipedia text for POS tagging and dependency parsing

Sequence Labeling

Transformers Other

Bert Base It Cased

This is a customized streamlined version of bert-base-multilingual-cased, specifically optimized for Italian language processing while maintaining the original model's accuracy.

Large Language Model Other

Distilbert Base En Fr Da Ja Vi Cased

This is a lightweight version of distilbert-base-multilingual-cased, supporting English, French, Danish, Japanese, and Vietnamese processing while maintaining the original model's accuracy.

Large Language Model

Transformers Other

Bert Base En Ja Cased

A compact version customized from bert-base-multilingual-cased, focusing on English and Japanese processing while maintaining the original model's representational capabilities.

Large Language Model Other

Distilbert Base En Vi Cased

This is a compact version of distilbert-base-multilingual-cased, specifically designed for English and Vietnamese, maintaining the original model's accuracy.

Large Language Model

Transformers Other

Distilbert Base Ur Cased

This is a lightweight version of distilbert-base-multilingual-cased, specifically optimized for Urdu, maintaining the accuracy of the original model.

Large Language Model

Transformers Other

Distilbert Base En Zh Cased

This is a compact version of distilbert-base-multilingual-cased, specifically designed for bilingual tasks in English and Chinese, maintaining the accuracy of the original model.

Large Language Model

Transformers Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase